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Self-checking logic 
flags errors 
as they happen 

Parag K Lala, North Carolina Agricultural and Technical State University 



Off-line testing can't detect the transient or 
intermittent faults that are emerging as the 
dominant failure mode in VLSI circuits. But 
self-checking circuits can continuously moni- 
tor circuits during normal operation and de- 
tect these faults. 

To ensure reliability, hardware must be testable from 
the printed-circuit board level down to the chip level. 
To varying degrees, designers use techniques such as 
built-in self-test (BIST) to improve the testability of 
their products. However, BIST and other off-line test 
techniques are not sufficient. Failures often occur in- 
termittently while the hardware is on line, thus circum- 
venting these techniques. You need to incorporate self- 
checking circuits in your design if you want to catch 
logic failures while the system is on line. 
Self-checking circuits have two significant advantages: 

• They detect errors caused by transient or inter- 
mittent faults. 

• Their fault latency is minimal; that is, the circuits 
detect errors immediately. 

Researchers have worked considerably on making 
logic blocks such as adders, multipliers, and PL A 
device totally self-checking. They have also proposed 
a variety of checkers (which are themselves self- 
checking) for various error-detecting codes. Unfortun- 
ately, very little exists for making logic circuits in 
general' self-checking. This article specifically discusses 
self-checking combinational-logic and PLA-device de- 
signs. 

Self-checking circuits comprise a functional circuit 



having coded outputs along with a checking circuit to 
observe the output of the functional circuit. Fig 1 
shows the model of a self-checking circuit. The func- 
tional circuit's outputs are error-detecting code words 
as well as being your desired logic outputs. 

The checker circuit checks the validity of the output 
code words. It maps code-word inputs to code-word 
outputs and noncode-word inputs into noncode-word 
outputs. Thus, by observing the output of the checking 
circuit, you can determine if any fault exists in either 
the functional circuit or the checking circuit itself. 

The following two definitions are the basis for self- 
checking logic design: 

• A circuit is self-testing if, for every fault from a 
prescribed set of faults, it produces a noncode word 
at the output for at least one input code word. 

• A circuit is fault secure if, for any fault from a pre- 
scribed set, it never produces a valid code word at 
the output for any input code word. "' " 

You can design any one of a variety of error- 
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Fig 1— Self-checking circuits comprise a functional circuit that has 
coded outputs along with a checking circuit that observes' the 
output of the functional circuit. By design, the functional circuit's 
outputs form an error-detecting code word. : ir :>*? 
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/here p is the number of check bits. In this article, 
= 3. The main features of MOD3 residue code are: , 
The check bits in a code word are only two bits. 
The residue of a bit in a code word is 1 if the bit 
occupies an even-bit position and 2 if the bit occupies 
an odd-bit position. 

To illustrate, assume that the output pattern of a 
ombinational circuit is 

Calculate the MOD3 residue of the pattern as 

ODD EVEN ODD EVEN 
MOD3[MOD3(f 3 ) + MOD3(f 2 ) + MOD3(f 1 ) + MOD3(f )] 
= MOD3(2 + + 2 + l) = 2. 

Suppose an output bit changes its value because of a 
fault in the circuit. This change will increment or decre- 
ment the residue of the original output pattern by 1 
or 2 depending on whether the erroneous bit occupies 
an even-bit or an odd-bit position, respectively. In 
other words, in the presence of a single-bit error, the 
residue will always be different from the calculated 
fault-free residue, thus signaling the error. For exam- 
ple, if bit f x in the above output pattern changes to 0, 
the new residue will be 

MOD3(2+0+0 + 1)=0. 

Self-checking combinational logic design 

To understand, how you can use MOD3 residue code 
in self-checking combinational circuits, consider such 
a circuit and assume that, for a particular input pat- 
tern, Z and Z' represent the error-free and erroneous 
output patterns, respectively. Further assume that 
Z~Z'= -w. If the MOD3 representation of w is not 
equal to 0, you can assume that a single-bit error or a 
multibit error is present in the output pattern. 



However, a multibit error in the output pattern can 
make w a multiple of 3, making MOD3(w) equal to 
and hence masking the presence of the error. As dis- 
cussed previously, for the MOD3 residue-coding 
scheme, an even bit contributes a 1 to the residue, 
whereas an odd bit contributes a 2. Thus, only an error 
combination of an even and an odd bit can make 
MOD3(w) equal to 0. 

To illustrate, assume that the error-free output of a 
combinational circuit for a particular input pattern is 

%' ft 4 ^2 *i V 
10 110 

Suppose a fault in the circuit affects the output pattern 
so that both f 2 and fi change from to I, that is, a 
"unidirectional" error occurs, which yields 

4 f» 4 h fi fo- 

■ 1 

Thus, because w = MOD3(38-32) = 0, the error in the out- 
put pattern is undetected. However, if you properly dis- 
tribute outputs, this error becomes detectable (Ref 5). 

The redistribution consists of changing the weights 
of certain bits and calculating the residue of an output 
pattern. For example, if you swap the positions of fo 
and f x and fo have weight of 1 and 2, respectively), 
the error-free output pattern and the corresponding 
residue are 

4 ft 4 h fi ^o 

1 1 l = MOD3(37) = l 

In the presence of the same unidirectional error, the 
output pattern and the residue become 

4 fi f3 f2 4 fo 

1 = MOD3(32) = 2 
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Fig 4— A PLD (a) can have failure modes other than "stuck-at" faults. Potential crosspoint failures (b) indude both having a cross point 
ne should not be or not having a connection where one should be. 
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the XOR lines e and e, as well as the residue lines r and r, (b) to be totally self-thecking. 



The arbitrary PLD in Fig 5 illustrates the self- 
checking technique. If the PLD satisfies the first condi- 
tion specified above, the PLD will enable only one 
product term for an input pattern. Thus, in the pres- 
ence of a fault, a desired product term may not acti- 
vate, or an additional product term may activate. 

However, by incorporating two XOR "trees" (Fig 5b), 
you can not only detect all single-bit errors at the out- 
put, but also detect the majority of multibit errors as 
well (Ref 4). Alternate product terms in the PLD are 
inputs to each XOR tree. In Fig 5b, if only one product 
term is active, the output of the XOR trees, e and e lf 
will be either 01 or 10. However, if none of the product 
terms is enabled, e and ej will be 00. Or if an undesired 
product term is active in addition to the desired one, 
e and e! will be 11. In either fault case, e and e! are 
not a l-out-of-2 code, which indicates a fault. 

The output of the PLD uses the MOD3 residue code. 
Hence, you need two extra sum lines, r and r^ The 
MOD3 residue of the PLD's output lines, X and Y, 
will appear on the output lines r and r^ For example, 
if XY = 11 (both X and Y lines for a particular input 
pattern , are crossed), the corresponding residue will 
be MOD3(3) = and hence no crosses at the added sum 
lines will occur. In other words, ^ = 00. 

In summary, this technique allows you to detect single 
faults in product terms by using XOR trees, and in sum 
lines from the MOD3 residue codes. Therefore, you 
can detect all single faults in a PLD on line. The tech- 
niques will also catch a significant portion of multibit ' 
errors. SHU 
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